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SUMMARY 


OBJECTIVE 

The  objective  of  this  report  is  to  present  a  general  algorithm  for  performing  data  association  or 
“correlation”  for  pairs  of  cumulative  track  data  sets.  Here,  each  pair  represents  potentially  the  same  target 
or  platform  of  interest;  and  the  algorithm  is  required  to  (1)  operate  on  both  linguistic  and  stochastic 
information  and  (2)  be  feasible  to  implement.  This  means  that  in  addition  to  the  now  well-established  use 
of  solely  probabilistic  information— usually  in  the  form  of  geolocation  and/or  onboard  radar 
information— linguistic  or  subjective  attribute  information  is  also  to  be  used— such  as  in  narrative 
descriptions  of  the  form  “appears  to  be  a  type  X  ship  flying  a  dark  flag  and  quite  long  (over  800  feet  or 
so?),  according  to  our  intelligence  reports.” 

A  previous  attempt  in  the  same  direction  —though  apparently  reasonably  successful  as  demonstrated 
through  practical  numerical  ejqjeriments— indeed  had  the  drawback  of  being  replete  with  ad  hoc 
components.  The  resulting  algorithm,  dubbed  “PACT”  (Possibilistic  Approach  to  Correlation  and 
Tracking),  is  fully  documented  in  reference  1.  The  approach  taken  here,  though  preserving  the  main 
structure  of  PACT,  aims  at  reducing  its  empirical  and  ad  hoc  aspects,  as  well  as  its  relatively  long  running 
times. 

RESULTS 

A  detailed  presentation  of  the  derivation  and  basic  properties  of  the  revised  algorithms  is  given  in  this 
report,  followed  by  tables  and  flowcharts  that  explicitly  exhibit  how  to  implement  the  algorithms. 
Unfortunately,  at  the  time  of  the  original  writing  of  this  report,  only  preliminary  numerical  verifications 
were  carried  out.  Apropos  to  the  task  of  final  implementation,  a  group  of  four  individuals  was  organized, 
lead  by  G.  F.  Kramer,  Senior  Analyst;  and  also  comprised  of  this  author;  Dr.  P.  G.  Calabrese,  Senior 
National  Research  Council  Fellow  at  NRaD;  and  Dr.  C.  J.  Funk,  Senior  Analyst,  Code  422.  In  turn,  this 
has  resulted  in  the  NRaD  Independent  Exploratory  Development  supported  TR,  Application  of  the  PACT 
Algorithm  to  Undersea  Surveillance  Data  Fusion. 

In  conclusion,  the  structure  of  PACT  has  been  modified  to  place  it  on  a  firmer  mathematical  basis.  In 
addition,  a  new  alternate,  simplified  approximation  version  has  been  obtained  that  appears  promising  in 
reducing  the  relatively  long  running  time  of  the  previous  version  of  PACT  (reference  1). 

RECOMMENDATIONS 

Further  extensive  numerical  testing  of  these  new  versions  of  PACT  should  be  performed  to 
demonstrate  both  their  eventual  real-world  feasibility  and  improvement  upon  the  traditional  use  of  only 
probabilistic  information. 
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1.  INTRODUCTION 


For  completeness,  the  basic  problem  of  muliilarget  correlation  and  an  outline  of  the  PACT  algorithms 
approach  to  this  are  presented  here.  For  further  details  concerning  the  structure  of  the  original  PACT 
algorithm,  its  implementation,  and  numerical  experiments  and  simulations  involving  PACT,  see  refer¬ 
ence  1. 

The  typical  correlation  problem  involves,  say,  2)  track  histories  that  have  not  yet  been  equated; 
i.e.,  decided  which  truly  represent  distinct  targets  of  interest  or  which  actually  represent  the  same  target  or 
targets  of  noninterest.  The  latter  includes  false  alarms,  whales,  noncombatants,  etc.  Call  each  such  track 
history  trkt,f=  1, .., n ,  which  consists  of  one  or  more  pieces  of  observed  or  reported  data  that  is  assumed 
to  be  categorized  by  four  indices;  f  referring  to  the  parent  track  trkt ,  itself;  T,  for  time;  S,  for  source, 
such  as  specific  electronic  sensor  system  or  human  intelligence  sources;  and  A  for  attribute  or  type  of  data. 
The  latter  can  refer  to  stochastically  oriented  attributes,  including:  A i— geolocation,  such  as  two- 
dimensional  posidons  and  velocity,  as  well  as  estimated  covariance  matrices  or  ellipses  of  errors  associated 
with  them;  A2— characteristics  of  parameters  of  onboard  target  radar  systems  such  as  pulse  repetition 
interval  (PRI),  scan  rate,  and  type  of  system;  and  A3— characteristics  of  some  parucular  nonradar  sensor 
system  onboard  the  target  that  is  reasonably  described  in  a  probabilistic  manner,  among  others.  On  the 
other  hand,  index  A  for  data  can  also  refer  to  linguistically  based  or  subjective  data,  where  the  associated 
numerical  descriptors  are  possibility  or  fuzzy-set  membership  functions.  Examples  of  this  can  include  the 
following:  A4— subjective  classification,  where  the  boundaries  of  the  different  classes  are  unclear  and/or 
many  overlaps  can  occur;  A3— narrative  descriptions  in  natural  language  of  target  maneuvers  or  paths;  and 
A6— narrauves  of  target  characteristics,  such  as  color  of  the  flag,  shape  of  ship,  length;  and  so  on. 

Before  further  expounding  upon,  and  analyzing  fuzzy-set  membership  funcdons  and  their  connections 
with  probability,  let  us  return  to  the  main  statement  of  the  problem.  Summarizing  the  basic  situation,  the 
following  diagram  (table  1-1)  presents  the  scheme  of  information  received. 


Table  1-1.  Scheme  of  information  for  various  track  histories. 


trki 

trki 

trk„ 

consists  of  data, 
each  typically 
written  as 

VA.S.T.l 

where  A,  S,  T 
run  over 
appropriate 
values  relevant 
to  track. 

consists  of  data, 
each  typically 
written  as 
yA.S.T.z 

where  A,  S,  T 
run  over 
appropriate 
values  relevant 
to  track2 

consists  of  data, 
each  typically 
written  as 
yA.S.T.n 

where  A,  S,  T 
run  over 
appropriate 
values  relevant 
tracks 

A  =  Attribute,  S  =  Source,  T  =  Time 


Next,  taking  a  simplified  approach  to  correlation  through  sequential  paired  comparisons  (but,  see 
reference  2,  page  13),  for  comments  on  the  potential  loss  of  accuracy  by  using  this  approach),  consider 
each  possible  distinct  pair,  say  (trkt^ ,  trk(^),  for  1  S  ^  yielding  a  total  of  (2) °  '^such  pairs. 
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Without  loss  of  generality,  denote  as  simply  trki  and  trki^  as  ii'ki,  but  keeping  in  mind  the  actual 
track  numbers  of  subsequent  processing.  In  addition,  for  completeness,  from  now  on,  for  each  fixed 
index  A  and  f,  replace 

data  set  forfri/=  [yA,T,i  •  (A.S’.T)  runs  over  the  possible  combination  of  values  for  trk/)  (1-1) 
by  the  single  updated  best  possible  value  updated  data  set  for  fz-il:, :  {yAjo./}  > 

where  To  refers  to  the  present  time  as  a  reference.  When  A  =  A,,  typically  a  Kalman  filter  or  linear 
regression  estimator  can  be  used  to  produce  yA.Toj.  Similarly,  for  other  stochastic  attributes  such  as  Aj, 
A3,  here.  When  A  is  a  linguistic-based  attribute,  the  process  of  single  data  replacement-updating  can  be 
more  complicated  (or  even  simpler),  depending  upon  the  specific  attribute  in  question.  One  could  use  a 
time  gate  or  majority  filtering  technique  or  some  transition  Markov-like  matrix  approach  in  that  case  for 
the  updating.  For  example. 


>A4,r.3.3  =  brown  (flag  color). 


>A«.T.2.3  =  black, 
yAj,r,i,3  =  brown, 


may  simply  distill  to 


(1.2) 


yA^.i  =  brown. 


(1.3) 


omitting  the  present  time  index  To- 

In  any  case,  a  typical  tableau  of  comparisons  is  shown  in  table  1-2. 


Table  1-2.  Tableau  of  data  comparisons  for  a  pair  of  typical  track  histories. 


Attribute  A 

yA^^  for  trki  ; 

yA^^  for  trkz  : 

Mean  Pos:  H^S'W,  long 
8''2'N  lat 

90%  conf.  ellipse: 
semimajor  axis  Ingth;  0.3  nm 
semiminor  axis  Ingth:  0.5  nm 
Angle  of  inclination:  16® 

Mean  Pos:  14°4'W,  long 
8“4'N,  lat 

90%  conf.  ellipse: 
semimajor  axis  Ingth:  0.5  nm 
semiminor  axis  Ingth:  0.8  nm 
Angle  of  inclination:  -43® 

A4 

A-II  class 

A-III  class 

brown  flag 

dark  brown  flag 

A? 

rather  long  (over  800  feet) 

longish  (over  500  feet) 
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In  addition  to  the  paired  comparison  of  information,  as  presented  typically  in  table  1-2,  one  must 
consider  the  relative  errors  or  reliability  of  the  data  so  that  observed/reputed/predicied  ^  can  be 
compared  numerically  with  any  potential  true  value,  say  2,4^.  ^  ,  /  =  1, 2  ,  /  =  1,  2,  3, m  •  Furthermore,  the 
relative  weight  of  importance  assigned  to  each  attribute  in  the  comparison  process  should  also  be 
ascertained  in  some  sense.  For  example,  how  much  more  or  less  important  is  geolocaiion  (Ai)  matching 
than,  say,  flag  color  (Aj)  matching? 

In  the  past— as  well  documented  in  references  2  and  3— geolocation  information  matching  was  almost 
soiely  emphasized,  with  relatively  little  emphasis  upon  other  attributes,  except  for  course-confirming  or 
course-disconfirming  evidence  use.  However,  with  the  advent  of  AI  techniques,  it  is  becoming  clearer  that 
other  attribute  information  can  play  critical  roles  in  decision  making  in  general,  and  correlation  in 
particular.  (For  example,  see  reference  4  for  employing  fuzzy  logic  in  decisions  useful  in  expert  systems.) 
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2.  THE  PACT  APPROACH 


In  the  philosophy  of  the  PACT  approach,  the  following  three  t>pes  of  information  are  combined: 
observed  data  (suitably  updated) ,  reliability/errors  of  data,  and  relative  weights  of  importance  in  the  form 
of  inference  rules.  In  short,  the  PACT  approach  centers  around  a  generalization— in  order  to  include 
subjective  as  well  as  stochastic  attributes— of  a  conditional  form  of  the  well-known  “total  probability 
theorem."  (For  example,  see  reference  5,  pages  39  and  40): 

P(x|y)=;^P  (;c|y.z)  ■/’(zjy)  ,  (2.1) 

ztD 


where  P  is  a  given  probability,  Jt  is  a  parameter  of  interest,  y  is  observed  data,  and  z  is  an  auxiliary 
variable  introduced  so  that  P(x\y,2)  and  P(z|y)  are  known  legitimate  conditional  probability  functions  of 
arguments  x  and  z,  respectively. 

In  a  purely  probabilistic  context,  the  expansion  in  equation  (2. 1)  completely  utilizes  the  three  types  of 
information  just  noticed:  observed  data  corresponds  to  variable  y,  reliability/errors  of  data  correspond  to 
the  condition  P(z\y),  and  relatitfe  weights  of  importance  correspond  to  the  different  values  for  the 
conditional  P(x\y,z).  Indeed,  the  latter  can  be  thought  of  as  a  model  for  the  inference  rule  connecting  y 
and  2  to  X.  Naturally,  a  different  choice  for  P(jtjy,2_)— depending  upon  the  meaning  of  auxiliary  variable 
2— will  produce  a  different  weighting  scheme. 

Of  course,  it  may  be  feasible  to  obtain  directly,  without  recourse  to  the  introduction  of  auxiliary 
variab'e  z,  the  prior  P(x)  and  the  conditional  form  P(y|x)— especially  when  the  regression  model  holds 


y  =  g(z)+/j(W) 


(2.2) 


for  some  known  (measurement)  function  g  and  known  objective  function  h,  and  where  IV  is  a  random 
variable  (or  random  vector)  statistically  independent  of  x,  representing  measurement/observaiion  error, 
also  with  a  known  probability  function. 

In  the  preceding  case,  it  follows  that  the  conditional  probability  function  for  y  given  x  is  obtainable 
from  equation  (2.2)  as 

P(y|x)=P(W^  =  /i-’(y-g(x)))  .  (2.3) 


In  turn,  Bayes’  Theorem  yields,  in  place  of  equation  (2.1), 

P(xly)  =  P(y|x)  •  P(x)/P(y)  , 

^  (2-4) 

P(y)  =  X  • 

xeQ, 

letting  Do  be  the  (discrete)  domain  of  values  of  A  with  P(y|x)  evaluated  through  equation  (2.3). 

Of  course,  the  Bayes’  Theorem  form  in  equation  (2.4)  is  only  useful  when  P(y|x)  can  be  obtained 
directly.  However,  it  is  often  the  case— and  certainly  so  for  the  main  application  here  to  correlation— that 
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P(y\x)  is  not  readily  obtainable  due  to  the  great  complexity  relating  x  to  y,  and  that  the  appropriate 
introduction  of  auxiliary  variable  z  will  supply  the  needed  link. 

Specifically,  if  x  corresponds  to  correlation  and  y  to  jointly  observed  geolocational  and  radar 
parameters,  by  letting  z  be  the  corresponding  true  values  in  place  of  y,  if  P(x\y,z)  =  Pi.x\z)  and  P(z\y)  are 
known,  equation  (2.1)  is  preferred  to  equation  (2.4). 

Another  issue,  mentioned  earlier,  is  the  desire  to  incorporate  linguistic-based  or  subjective  informa¬ 
tion  with  stochastic  information.  Often,  in  effect,  such  information  involves  a  descriptor  that  does  not 
represent  an  ordinary  set,  but  rather  a  fuzzy  set  A  determined  by  a  corresponding  membership  function 
<t>{A)  :  Da -*  [0,  1] ,  where  for  any  xt  Da.  0(A) (x)  is  interpreted  as  “the  degree  to  which  x  belongs  to  A." 
Examples  of  this  include  A  =  “tall,"  “long,”  “happy,”  “cold;”  and  where  attribute  domain  of  values  Da 
can  be  a  fixed  population  of  people  or  ships.  On  the  other  hand,  one  can  in  a  sense,  reverse  the  roles  of  A 
and  Da  by  choosing,  e.g.,  A  to  represent  "typical  ship”  and  Da  to  consist  of  a  set  of  classifications 
{Cu...  Cm }  so  that  the  evaluation  0(A)  (Cy)  means  the  degree  to  which  Cj  belongs  to  A,  i.e.,  to  which  a 
typical  ship  (prior)  is  of  Cy  type.  (See,  for  example,  Dubas  and  Pride  [reference  6]  for  background 
information  on  fuzzy  sets.) 

Further  details  and  analysis  of  fuzzy  sets  representing  linguistic  descriptions  are  given  in  the  following 
chapter.  In  any  case,  the  main  thrust  of  PACT  in  utilizing  both  linguistic-based  and  probability 
information  is,  first,  to  extend  the  purely  probabilistic  expansion  in  equation  (2.1)  to  a  fuzzy-set  form  for 
each  inference  rule  used,  and  second,  to  combine  the  resulting  posterior  descriptions  of  correlation  for 
each  inference  rule  used  into  an  overall  posterior  description. 

The  chief  motivation  for  the  current  revision  of  PACT  is  to  reduce  the  empirical  or  ad  hoc  aspects 
relative  to  the  choice  of  logical  operators— especially  the  use  of  material  implication  in  fuzzy-set  form  for 
modeling  conditioning  in  inference  rules  and  the  combining  of  inference  rules.  In  addition,  a  new 
upper-bound  approximation  for  PACT  will  be  presented,  potentially  greatly  reducing  PACT’S  relatively 
long  running  times.  Before  obtaining  these  results,  a  fresh  discussion  is  presented  motivating 
the  encompassing  use  of  fuzzy  sets  relative  to  information  that  is  in  part  linguistic-based  and,  in 
part,  stochastic. 
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3.  FUZZY  SETS  AND  PROBABILITY:  PRELIMINARIES 


Let  F;R  -♦[0,1]  be  a  given  function.  Then,  recall  from  elementary  probability  theory  that  letting 
Df  =  {x  ;  xeR  A  F{x  +  A)  -  F{x)  >  0  ,  for  all  A  >  0}  . 


be  the  support  of  F,  then  F  is  a  cumulative  probability  distribution  (cdf)  over  Df  if  F  satisfies: 
(i)  F  is  nondecreasing  continuous  from  the  right. 


(ii)  lim  F(a:)  =  0  .  Urn  F{x)  =  1  . 

X  — -•  ,  X— ♦«  . 

xtDf  xtDf 


Note  that  if  Df  is  finite,  say, 


Df=  {jti, ...  Jtfl}  with  xi  <X2  <  ...  <Xn  .  (3.3) 


xyeR  ,;■=  1,  then  defining/:  Df-*  (0. 1]  by 


f(xj)  i  F(xj)  -  F(xj.i)  >  0.  ;•  =  1, , n  .  (3.4) 


with  the  convention 


jto  =  -  »  ,  whence  F{xo)  =  0  , 


(3.5) 


implying 

/(x.)  =  F(x,)  ,  (3.6) 


so  that 


n 

0  S  f(xj)  Si,  ;■  =  1, .  ,  n  ,  ^  f{xj)  =  1  , 


(3.7) 


establishing/  as  a  probability  function  associated  with  F.  As  an  interpretation:  One  and  only  one  outcome 
XjtDf  can  occur  at  a  time,  with  probability  /(xj). 
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Returning  to  the  general  case  for  Df.  associated  also  with  F  is  the  nested  (or  level)  random  set 


5(F.  U)  =  F-'IU,  1]  =  { jc  :  xeR  A  F(x)  >  U  }  (3.8) 

where  U  is  any  random  variable  (r.v.)  that  is  uniformly  distributed  over  [0,1].  Note  that  one  has  the  basic 
one-point  coverage  relation 

Pr  (X  e  S(F.  [/))=  Fr  (U  <.  Fix))  ^3 

=  Fix)  ,  all  xfR . 

Indeed,  making  use  of  the  basic  properties  of  pseudoinverses  denoted  by  (  )  ^  (see,  for  example, 
Goodman  and  Ngugen  [reference  7],  pp.  121-124),  one  can  show,  slightly  abusing  notation,  that 


5(F,ty)=  [F^(C/),cH  . 


(3.10) 


actually  a  nested  random  interval  with  constant  fixed  right  end  point  Cf  ,  where 


Cf  =  F-'(l)nDf 


(3.11) 


and  random  variable  F^i^)  for  its  left  end  point,  noting  r.v. 

Ft((/)=infF->(i/.l]  (3J2) 

=  inf{x.-xeR  <t  (/  S  F(x)}  , 
has  precisely  cdf  being  F  itself! 

d 

Conversely,  if  V  is  any  r.v.  over  some  Dp  Q  R,  with  cdf  F,  say,  and  cp  =  sup  iDp),  then  the  random 
interval  [V,  Cf]  can  be  naturally  identified  with  F  via  the  one-point  coverage  relation: 


Prix€[V.CF])  =PriV^  x)  =  F(x)  . 


(3.13) 


for  all  xeDp- 

One  can  also  associate  directly  with  the  probability  function  /  for  F  when  Dp  is  finite,  say  as  in 
equation  (3.7),  the  random  nested  set 


Sif.U)=r'[U,  1] 

=  {x  ;  xeDp  <t  fix)  S  U] 

Note,  again,  the  one-point  coverage  relation 


(3.14) 


PrixeSif,  U))  =/(x),  all  xeDp  , 


(3.15) 
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and  most  importantly,  since  /  is  a  probability  function  (see  equation  3-7), 
1  =:^Pr(xeS(f,l/))  ■ 

X,  Dp 


(3.16) 


The  construction;,  in  equations  (3.11)  through  (3.16)  are  all  special  cases  of  the  relations  between 
fuzzy  sets  and  random  sets  pointed  out  originally  in  references  8  and  9;  and  later  in  reference  7, 
Chapters  3  and  4. 

Fuzzy  sets  and  their  operators  provide  a  way  to  represent,  relatively  simply,  numerical  uncertainties  of 
possibly  overlapping  complex  events  (references  6  and  7).  The  following  examples  and  analysis  should 
provide  a  good  basis  for  illustrating  this  point  and  relating  fuzzy  sets  to  probability  concepts: 

Suppose  an  adjectival  or  descriptor  attribute  A  is  given  with  an  associated  numerical  (unction 
<p(A)  :  Dji  -*  (0,  1] .  (At  times  for  convenience  —  when  not  confusing  —  we  can  simplify  previous  domain 
notation.)  To  begin  with,  A  could  represent  the  abstraction  corresponding  to  any  given  cdf  F.R  -►  [0,  1] 
with  support  ^F,  so  that  ^(A)  =F.  One  could  label  A  here  as  “F-ish”  or  simply  “F,"  and  so  on.  ^(A) 
frequently  does  not  behave  like  a  cdf  or  probability  function.  Indeed,  if  A  is  first  given  as  a  linguistic 
descriptor  or  adjective  relative  to  some  appropriate  domain  and  then  a  suitable  associated  numerical- 
valued  function  ^(A)  .Da-*  [0. 1]  is  sought  for  the  interpretation,  for  each  xcDa: 


4>{A)(x)  s “degree  to  which  x  has  property  A" 

=  “membership  level  of  x  in  fuzzy  set  A”  (3.17) 

=  “possibility  of  A  occurring  at  x” 

Typical  examples  include  the  descriptors: 

A)  =  happy,  DA^  =  adult  males  living  in  Denmark  (3. 18) 

A2=long,  Da^  =  1100',  1500']  (3.19) 

representing  possible  ship  lengths, 

A3  =  dark,  Daj  =  spectrum  of  colors  (3.20) 

A4  =  cIose,  =R*  (3.21) 

I  ' 

I  representing  distance  between  targets, 

As  =  typical  prior  target,  Daj  ~  (Ci, ...  Cio}  ,  (3.22) 


where  Cy  represents  the  class  of  ship,  where,  in  general,  the  Cy’s  are  overlapping.  Note  the  effective 
reversal  in  As  where  the  domain  values  play  the  role  of  the  descriptor  and  the  attribute  is  a  relatively 
neutral  typical  population  element. 

In  a  similar  vein,  one  can  make  a  fuzzy  set  interpretation  of  cdfs  and  pf's,  noting  again  the  role  of  the 
descriptors  being  determined  by  the  domain  values,  rather  than  by  given  prior  atiritubes. 
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Even  if  in  any  natural  interpretation  of  an  attribute  A,  4>{A)  :  -*  (0,  1]  is  nondecreasing— and 

hence  formally  similar  to  a  cdf— the  interpretation  of  <l>iAi)ix)  can  be  quite  different  than  if 
<^{A)  represented  a  cdf.  For  example,  ^(Aj)  has  the  form  of  a  cdf,  and,  hence,  relative  large  values  of 
X€Dj\,  yield  large  values  for  ^(A2)(jf).  as  should  be,  considering  the  meaning  of  “long."  On  the  other 
hand,  one  can  question  how  useful  is  the  attribute  Ae  defined  to  correspond  to  cdf  <p(.Ai)  :r2-v  [0,  1]  to 
describe  a  target’s  location-  Certainly,  for  jteR^  large  positive  in  both  components,  4>{A(,){x)  is  close  to  1, 
but  x  can  be  quite  far  from  the  mean  of  the  distribution.  In  this  case,  the  local  attribute,  i.e.,  the 
discretized  probability  function  corresponding  to  <t>(A(,),  is  a  more  meaningful  numerical  measure. 


10 


4.  FUZZY  SETS  AND  PROBABILITY:  GENERAL  CASE 


In  any  case,  one  is  led  to  the  following  general  random-set  interpretation  for  fuzzy  sets  extending  the 
material  just  covered  in  Chapter  3. 

For  any  given  attribute  A  with  associated  possibility  or  fuzzy  set  membership  function 
0(A)  :  D,4 -►  [0,  1] ,  define  the  associated  (unique)  nested  random  set  S{A,U)QDa  analogous  to 
equation  (3.8)  or  equation  (3.14): 


S(A,(/)  =0(A)-M(/.  1] 

=  {x  :  xt  Da  and  U  <  0(A)  (x)}, 


(4.1) 


U  any  r.v.  uniform  [0,1]  distributed. 

It  then  follows  immediately  that  for  all  ^^Da,  the  one-point  coverage  relation  holds 

Pr(xfS(A.y))  =  Pt(I;  <  0(A)  (x)) 

=  0(A)(x)  . 


(4.2) 


When  again  Da  is  finite,  say  then  equation  (4.2)  can  be  interpreted  also  as  identifying  the 

possibility  of  x  occurring  relative  to  A  as  the  probability  of  the  filter  class  or  class  of  interacting  subsets  of 
Da  relative  to  x,  x  ,  occurring,  where 

x={a:x€  a  Q  Da}  (4  3) 


Thus, 


0(A)(x)  =Pr(S(A,t/)e  x) 

=  2  Pr(S(A,  CO  =  a)  (4.4) 

ae  * 

=  Pr{Or  (a  occurs))  , 
ae  X 

summarizing  the  fundamental  relation  between  attribute  possibility  outcomes,  or  equivalently  fuzzy  set 
membership  function  levels,  and  probabilities  of  interactive  sets. 

Note,  also,  that  any  typical  outcome  S(A,l/)C  Da  can  be  thought  of  as  all  elements  of  S{A,U) 
occurring  simultaneously. 

Note  the  following  three  special  cases  for  S(A,  U),  depending  upon  the  structure  of 
0(A)  -.Da-*  [0,  1],  slightly  abusing  notation  {Da  need  not  be  finite): 

(i)  0(A)  nondecreasing  (but  not  necessarily  a  cdf). 

S(A,C/;=[0(A)»(CO,cm]  .  (4.5) 

where  ca  is  determined  by  the  relation 

sup  0(A)  (X)  =0(A)(Ca)  ■  ,4  gx 

xcDa 
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(ii)  0(A)  nonincreasing. 

Defining  A'  also  over  Da  via 
1-0(A) 

S(A,U)=  Da-S(A',1-U) 

=  {dA.<t>{A')\\-U)\ 

=  ldA,<l>iA)HU)]  . 

where  dA  *s  determined  by  the  relation 

inf  0(A)  (x)  =0(A)(dyt)  . 

X€Da 

(iii)  Da  =  {xi,  ..,x„} 

S(A,U)=  (xj  :  xjcDa  &  4>(A)ixj)  S  f/}  . 
noting  if 


(4.7) 


(4.8) 


(4.9) 


(4.10) 


0  S  0(A) (acy)  <  ..  <0(A)(jt„)  S  1  , 
then 

Pr(S(A,  U)={xj.Xj,x.  ...x„})  *  0(A) (jcy)  -0(A) (Jty.,)  , 

lot  j  ~  I,  n,  with  the  convention  -  using  a  fictitious  JCo  and^^.^  - 


0(A)  (xo)  =  0,  0(A)  (x„+,)  =  1  , 


(4.13) 


so  that 


Pr{S{A,U)=ff)  =l-0(A)(jt„). 
PrlS{A,  C/)  —  (xi, . X/i})  —  0(A)  (.Ti) 


(4.14) 


Next,  note  that  there  can  be  many  distinct  random-set  representations  in  general  of  any  given  fuizy 
set  membership  function.  For  example,  if  0(A)  :Dy,-*  [0. 1]  is  such  that  =  {x,. ...  x«} ,  then  the 
random  set  r(A)  Q  Da  also  represents  0(A)  where  for  any  subset  a  Q  Da 

Pr{T{A)  =  a)  i  (4- 15) 
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T(A)  is  the  maximal  enuopy  representative,  while  S(A,U  )  has  minimal  entropy-like  propenies— in  any 
case,  very  different  in  form  (see,  for  example,  reference  7,  especially  section  5A).  Conversely,  any  ran¬ 
dom-subset  5  £  D/t ,  say,  determines  a  unique  one-point  coverage  equivalent  fuzzy  sci  membership  func¬ 
tion,  namely 


^(A)(x)  = /’r(x6S),  xcDa, 


(4.16) 


Thus,  with  the  plethora  of  possible  (one-point  coverage)  random-set  representations  for  a  given  fuzzy 
set,  which  one(s)  should  we  choose  and  how  does  this  choice  relate  this  to  established  fuzzy  set  logical 
operators  such  as  Zadeh's  proposed  min— or  product  at  times— for  intersection,  max— or  probability  sum 
(as  follows)  for  union,  and  l-(  )  for  complement,  etc.? 

In  response  to  the  preceding  discussion,  note  that  the  nested  random-set  representation  for 
attributes  A  referring  to  cdf’s  reduce  nicely  to  the  natural  fixed  right-hand  constant,  left-hand  r.v.  internal 
case  as  in  equations  (3.8)  through  (3.13).  For  the  other  extreme  cases,  such  as  A  corresponding  to  <t>{A) 
being  a  nonincreasing  membership  function  or  to  a  legitimate  probability  function,  also  naturally 
corresponding  forms  held  for  the  nested  random-set  representations.  (Again,  see  equations  (4.7)  through 
(4.9)  and  (3.4)  through  (3.6). 

As  a  consequence  of  the  above,  as  well  as  for  ease  of  interpreting  logical  operators  over  fuzzy  sets 
through  ordinary  corresponding  ones— as  will  be  seen  in  the  next  chapter— the  nested  random-set 
representations  will  be  considered  from  now  on  as  the  natural  probabilistic  interpretation  of  fuzzy  sets. 
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5.  FUZZY  SETS  AND  PROBABILITY:  OPERATOR  RELATIONS 


Taking  the  natural  nested  random  set  representation  of  fuzzy  sets  (see  also  Goodman  (reference  10] 
for  further  justifications  in  terms  of  flou  sets  and  extensions  of  the  Negoita-Ralescu  representations), 
consider  the  following  development,  based  upon  the  concept  of  copulas,  i.e.,  cdf’s  for  n  r.v.’s  t/i, U„ 
each  distributed  unif-[0,l]  (see  reference  7,  section  2.3.6,  or  Schweizer  and  Sklar  [reference  11]  for 
background) . 

Roughly  speaking,  copulas  can  be  considered  to  be  natural  generalizations  of  ordinary  two-valued 
conjunction.  Similar  comments  are  valid  for  cocopulas— i.e.,  DeMorgan  and  related  transforms  of  copulas 
representing  generalizations  of  ordinary  disjunction.  Examples  of  copulas  and  cocopulas  are  (min,  max), 
(prod,  probsum)  where 

probsum  (ti,..,r„)=  1-  n  (1-/;),  /yf[0, 1],  j=l....n.  (5.1) 


Another  copula,  cocopula  pair,  among  an  infinitude  of  possibilities,  for  n=2,  is  (maxbnd,  minbnd) 
where 


maxbnd  (ti,/2)  =  inax  (f,+/2-l,0)  ,  (5.2) 

minbnd  =  min  (ti  +  tz,  1)  ,  (5-3) 

for  all  tyf  (0, 1], 

Then,  denoting  generically  cop:  [0, 1]"  -♦  [0, 1]  for  the  (joint-n)  cdf  of  i/j, ..,  Un ,  for  any  attributes  Aj 
with  domain  D/^j,  js  1,  ..,n,  define  conjunction,  A^  A  A„,  disjunction  AjV  •  •  v  An  and  negation 
A'j  through  the  corresponding  membership  functions,  where 0(Ai  ’  •  d  A„):Da^  x  •  •  x  Da„  -*  (0,  1] 
is  given  by 

<l>{Ai&- ’  An)iXx.  ..,Xn)  =  COp(^(Ai) (x,), ^(A„)(xn))  . 

and  where  0(Ai  V  •  •  V  An)  :  x  •  •  x  Da„  -*■  [0, 1]  is  given  by 

^(Ai  V  •  •  V  An)(Xi,  ..,X„)  =  COCOp(^(Ai)(jt,),  ..,0(An(Xn)) 

=  2(-  •  cop{((t>{Aj){xj)),jtK) 

and 

4>iA!j(,Xj))i  \-^{Aj){xj)  , 

lot  a\\  XjeDAj  y=l,...,n 

Sklar’s  Theorem  (reference  11)  is  compatible  with  the  result  that  0(Ai  4  •  •  <t  An)is  a  legitimate  cdf 
over  DA■^'X  '  '  ^  Da„  if  each  4>(Aj)  is  a  cdf  over  Daj€R,  j=l,...,n.  Conversely,  any  cdf  F  over 
Dai  X  •  •  X  Da„  Cr"  can  be  expressed  as 


(5.4) 


(5.5) 


(5.6) 
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f  =  Fi&  •  •  &Fn  -  COpo(F,, ...  Fn)  . 


(5.7) 


where  Fj  is  the  marginal  cdf  of  F,  j=l,  ...  n. 

In  addition,  the  basic  homomorphic  tie-in  between  <t,  V ,  (  )‘  relative  to  all  fuzzy  sets  and  ordinary 
n,  U,  (  )'  relative  to  nested  random  sets  can  be  established: 


For  all  XjcDAjJ  =  1, ...  n.  by  the  definition  of  cop 

4>iAi  &  A„)(x, . x„)=Pr((Ui  S  0(A,)(jc,))  4  ■  •  <4  ((/„  <  0(A„)(j,))) 

=  /’r((x,£.S(A,.f/,))  A  •  •  d  (x„  c  S(A„,  U„))) 

Hence,  when  •  ‘  ~Da„  =  D,  for  all  xeD  (equation  5.8)  implies 

0(Ain-  •  nA„)(x)  =^(Ai  •  •  <4  A„)(x.  ...x) 

=  Pr(xeS(Au  C/,)  n  •  •  nS(A„.  t/„))  . 


(5.8) 


(5.9) 


For  all  X;  e  Z>x..;=  1. •••.«.  by  reference  11,  the  definition  of  cop  and  the  basic  Poincare  alternate 
sum  expansion  of  probabilities, 

^(A,  V  •  V  An)  (x„ . . . x„)  =  Pri  dc  (Uj  S  0(Ay)  (xj))) 


*  Priv^  (Uj  S  ^(Ay)(xy))) 

=  Pr((xie5(A|,  f/i))  V  •  •  V  (xncS  (An,  t/n))) 
Hence,  when  Da^  =  ’  '  ~  Da„  =  D,  for  all  xeD  (equation  5.10)  implies 

d 

0(AiU  •  •  UA„)(x)  =0(Ai  V  •  •  V  An)(x....x) 

=  Pr(xeS(Ai.  t/i)  U  •  •  U5(A„.f/n)) 

Finally,  it  is  obvious  that  for  all  xj  €  Daj  , 

^(Aj(x;))  =  l~Pr(xjeS{Aj,  Uj)) 

=  Frixj€DAj-*  S(Aj,  Uj)) 

^PrixjtSiAj,  \-Uj))  . 


(5.10) 


(5.11) 


(5.12) 


Equations  (5.4)  through  (5.12),  together  with  the  previous  justification  for  the  use  of  unconditional 
fuzzy  sets,  logical  operators  in  the  form  of  copulas,  cocopulas  and  1~(  ),  model  linguistic-based  attributes 
and  other  logical  connections,  compatible  with  probability  theory. 
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6.  CONDITIONAL  FUZZY  SETS 


Finally,  consider  the  modeling  of  conditional  attributes.  Reference  10,  section  9,  shows  that  if  A.x,Ai 
are  two  given  attributes,  with  any  corresponding  domains  then  the  conditional  attribute 

(A1IA2)— read  “Aj  given  A2”— should  be  characterized  for  fixed  copula,  through  the  membership 
function  ^(Ai|A2)  :  Oa,  x  [0.  1],  where  for  any  xj  cDaj  ,  y  =  1, 2, 


0(A,  A  A2)(j:,.X2)MA2)(x2), 


^(Ai|A2)(Xi,X2)  = 


provided  0(A2)  (X2)  >  0  . 

10,1] 

provided  ^(A2)  (X2)  =  0  , 


(6.1) 


where  <p(Ai  A  A2)  is  given  in  equation  (5.8)  for  n=2.  Clearly,  equation  (6.1)  reduces  to  a  version  of 
ordinary  conditional  cdf’s  when  0(Ai)  and  ^(A2)  are  cdf’s. 

In  turn,  one  can  develop  an  entire  calculus  of  logical  operations  extending  A,  v,  (  )'  —and  hence 
n,  U,  -  as  well— from  the  unconditional  case  to  the  more  general  conditional  one.  This  is  based  on  the 
calculus  of  conditional  logical  operations  relative  to  nonfuzzy  or  ordinary  conditional  events.  For 
background  on  conditional  event  algebra,  see,  for  example,  the  recent  summaries,  reference  10, 
section  6;  reference  12,  or  the  forthcoming  monograph,  reference  13.  In  brief,  if  R  is  a  Boolean  algebra 
of  ordinary  sets  or  equivalently  unconditioned  events  or  propositions,  then  for  all  a,  b,  c,  d  e  R,  the 
following  extensions  of  the  ordinary  Boolean  logical  operation  A  (or  n),  v  (or  n),  and(  )’(or  -)  hold: 


{a\b)  A  (c|d)  =  (abcd\a'b  v  c'd  v  bd)  , 


(a\b)  V  (c|d)  =  (ab  v  cd\ab  cd  y  bd)  ,  ^ 


(a\by  =  ia’\b)  , 


(6.2) 


where  for  shorthand 


abed -a  itb  i-c  &.d  , 
a'b  ^a'  it  b  , 


(6.3) 


and  so  on. 

In  turn,  as  developed  in  reference  10,  section  9,  one  can  use  equation  (6.2)  to  determine  in  a  natural 
way  the  further  extensions  to  conditional  fuzzy  sets. 

This  is  accomplished  through  use  of  the  nested  random-set  representation  of  the  conditional  fuzzy  sets 
and  their  application  of  equation  (6.2).  For  example,  for  any  attributes  Ay  with  domains  Daj,  xj  cDaj 
y=l,2,3,4,  with  a  fbted  choice  of  copula  cop,  letting 


17 


Ofixj)  i  ixjeS{Aj,  Uj))  =  (Uj  ^  4>iAj){xj)) 

=  Uj^lO,^(Aj){xj)]  (boolean)  , 

y=l,2,3,4,  one  has 

^((^lilAz)  V  (A3|A4))(j:i.;t2.;c3.j:4) 

=  /’'■((ai(;ci)|a2(x2))  v  (Q3(-<3)|Q(4(Jt4)))  (6.4) 

=  Pr(ao\fio)  . 

Oo=ai(jCl)a2(jC2)  V  0!j(X3)Q(4(JC4)  . 

(6.5) 

^o  =  ao  V  (a2(Jt2)  qaCxa))  • 

In  turn,  equation  (6.5)  can  be  used  to  evaluate  equation  (6.4)  fully.  (Again,  see  reference  10,  section  9, 
for  further  details.)  Of  course,  one  could  utilize  in  place  of  the  above  development,  a  more  simplified 
approach,  where,  for  example,  V  as  in  equations  (6.4)  and  (6.S)  is  approached  the  same  formally  as  in  the 
unconditional  case;  for  example,  equation  (S.5). 

Finally,  note  that  based  on  equation  (6.1),  one  can  show  (see  reference  10,  equations  (8.42)  and 
(8.43))  that  if  cop  is  associative  and  distributive  and  h  :  Z  -*■  [0, 1]  is  any  auxiliary  function  so  that,  using 
V  to  indicate  max  or  sup, 

max  h(2)  s  1  ,  (6.6) 

uZ 

Then,  for  all  xeX,  yeY,  the  analog  of  equation  (2.1)  holds,  that  we  call  the  PACT  expansion: 

(f(x)\g(y))^  max  (/■(j:)|  JCy,z))  *  (/«(z)U(y))  (6.7) 

« Z  ' 

where 

J  (y,z)  ^ cop(/i(z),g(y))  ,  (6.8) 

for  all  yc  T,  zeZ 
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7.  DEVELOPMENT  OF  MODIFIED  PACT:  MODIFIERS  AND 
ATTRIBUTE-MATCHING  FUNCTIONS 


With  ail  of  the  preliminaries  in  place,  we  can  establish  the  new  modified  PACT  algorithm.  In  the 
former  version,  a  number  of  ad  hoc  assumptions  were  made,  including: 

(1)  combining  all  inference  rules  beforehand  into  one  joint  rule  using  an  arbitrary  choice  for 
conjunction 

(2)  identification  of  conditioning  with  a  fuzzy-set  version  of  material  implication 

(3)  ad  hoc  forms  for  matching  functions  used  to  construct  the  inference  rules. 

In  addition,  PACT  also  had  the  drawback  of  requiring  excessively  long-running  times  due  to  the  required 
disjuncdve  iteration  of  variables  running  over  the  auxiliary  attribute  domains.  (Again,  see  reference  13  for 
further  details.) 

In  brief,  the  modified  version  of  PACT  here  addresses  (1)  by  using  only  one  inference  rule  (f\ })  in 
equation  (6.7)  at  a  time,  followed  by  combining  the  outputs.  In  (2),  conditioning  is  assumed  to  be  of  the 
above  form  (J\ }),  rather  than  in  material  implication  form.  However,  later,  when  an  abbreviated  version 
of  PACT  is  sought,  conditioning  is  replaced  by  a  more  expedient  form.  For  (3),  a  matching  function  is 
considered  simply  a  joint  fuzzy-set  membership  function  of  the  same  marginal  attribute  connected  by  a 
copula.  Finally,  a  simplified  substitution  and  approximation  relative  to  the  PACT  expansion  is  shown  to 
lead  to  a  greatly  reduced  running-time  version  of  PACT. 

Apropos  to  the  fuzzy-set  version  of  equation  (1): 

Let  variable  ze  (0, 1]  represent  the  true  level  of  overall  track  association  or  “correlation"  of  two  given 
apparent  targets  of  interest  that  may  or  may  not  actually  represent  the  same  target.  Let  attribute  Ao 
correspond  to  correlation  so  that  <p{Ao)  ■  (0, 1]  -*  (0, 1]  represents  its  membership  function,  which  may 
well  be  simply  the  identity 

Introduce  auxiliary  attributes  and  variables  that  provide  potential  information  about  Ao  conditioned 
on  any  observed  attribute  values:  Ai.Ai,  ...,Am.  For  example,  one  could  have: 


A]  =  geo2  (two-dimensional  locations) 


(7.1) 


with  4^(Ai)  :  Dai  -*  [0,  1],  where  Da^  £  R2  is  some  finite  domain  with  0(Ai)  a  suitable  discretized- 
truncated  version  of  Gaussian  distribution  Ai),;iie  R2  mean  and  Aj,  2  by  2  positive  definite  covari¬ 

ance  matrix  of  error,  so  that  typically 


<t>{Ai)(zi)  =kuexp~Xn(zi-/Xi)'^A''(zi-fii). 


for  some  positive  scaling  factors  A,i,Ai2:  An  ^  1.  Or,  one  could  have 


(7.2) 


A2  =  color  (dominant  flag  color  of  enemy) 


(7.3) 
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with  associated  membership  function  given  numerically  as 


1 

^(AzXzz) 

0.5 

• 

—  • 

0 

• 

• 

1  .  1  1  - 1  ■  1 - 

YEL  WH  OR  BR  BL 


(7.4) 


In  general,  ip(Ai)  :  Da  -  -*  [0,  1]  is  assumed  known  from  prior  information  with  Dt  some  domain  of 
values  for  attribute  A,-.  /=l,  ...  m. 

Next,  suppose  a  range  of  modifiers  has  been  established  with  associated  membership  functions,  so  that 
modifier  j,  mody  has  associated  membership  function  ^(mody)  :  [0, 1]  [0.  1].  A  simple  collection  of 

such  modifiers  would  correspond  to  the  linguistic  and  numerical  interpretation  as  follows: 


very  high 

modi 

0(modi)(r)  =/“!, 

t€[0.  1] 

high 

mod2 

0(mod2)  (r)  =  /"J, 

fe(0, 1) 

moderately  high 

mod3 

^(mod3)(0  =<‘’3, 

t€{0,  1) 

medium 

mod4 

0(mod4)(O  =t“*, 

feI0,l] 

low  medium 

mods 

^(modsX/)  =<"3, 

t€  [0,  1] 

low 

mods 

** 

0(mod6)(/)  =  t“6, 

t€  [0,1] 

very  low 

modv 

** 

0(mod7)(t)  = 

t€  [0, 1] 

where  for  suitably  chosen  values  the  exponents  order  as 

0(1  >02  >0(3  >0(4  =  1  >as  >a«  >07  >  0  .  (7.6) 

Note  also  the  typical  interpretations 

not  high  ♦*  1  -  mod2  ♦♦  1  -  ,  (7.7) 

etc. 

Next,  define  the  basic  matching  attributes  match  (Aj)  as  follows: 

Pick  a  copula  copj :  [0, 1]^  -*  [0, 1]  that  appears  most  appropriate  and  define 

^(match(A/))(r/.i,z/,2)  =  cop,(^(A,)(2,-.i),  ^(A,)(zi.2))  (7.8) 

for  all  zijeD^f,  j=  1,  2;/=  1,  ..,m.  The  purpose  here  is  to  describe  the  joint  distributional  value  of  r,.) 
and  zi,2,  not  just  the  “distance”  (actual,  statistical,  or  possibilistic),  and  it  appears  equation  (7.8)  captures 
this  appropriately. 


20 


For  example,  returning  to  equation  (7.1),  note  that  (omitting  subscript  commas  when  unambiguous) 
for  all  zii,zi2  €D\  CR^,  by  use  of  Schweizer’s  inequality  and  eigenvalue  properties,  it  easily  follows  that 
equation  (7.2)  yields  for  copi  =prod,  for  example. 


^(match(geo))(zi,,z,2)  =0(geo(.'’n))  •  0(geo)(zi2)) 
=  rel(z,,Z2)  *  abs(z,,i2)  . 


(7.9) 


where 

rel(z„Z2) 


abs(z„Z2)  = 


2  exp 


hn-Hx  r 

maxeig(Ai) 


jziz-iUi 

mineig(Ai) 


for  all 


(7.10) 


(7.11) 


i  “  -12  fo  • 


(7.12) 


Hence 


( 


lim 

||2i2-/^1  !-►  +  '» 
Jzii-Ziajs  Co 


)' 


(  abs(zi,22))  I  0 


(7.13) 


all 


Mu-A*!  I  >  (maxeig(Ai)/mineig(Ai)  •  Co  - 


(7.14) 


Thus, 


( 


lim 

|ziz-/*iH  +  “ 

I  2li  -2i2  Co 


0(match(geo)(zi,,Zi2)) 

) 


=  0, 


(7.15) 


showing  Zii  and  z,2  being  close  to  each  other  is  not  enough  for  matching  relative  to  the  parent  attribute. 
Note  the  following  two  important  opposite  limiting  deterministic  subcases; 

(i)  Ai  -►  0,  i.e.,  maxeig  (Aj)  -*  0.  This  is  equivalent  to 


0(geo2)(z)  =  (Kfonecker  delta)  , 


(7.16) 
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for  all  2€  R2,  in  which  case,  for  any  copula 


0(match(geo2))(zi,Z2)  = 


Zl  =2z=/il 
z,,22  are  otherwise 


(7.17) 


for  all  Zi,Z2  €  R2. 

This  case  is  not  satisfactory,  since  the  above  indicates  that  one  knows  absolutely  that,  from  a 
prior  viewpoint,  the  target  was  at  fi  and  nowhere  else  I 

(ii)  Ai  ae,  i.e.,  mineig  (AO 

This  is  equivalent  to,  regardless  of  copula  chosen. 


0(match(geo2))(zi,22)  =copi(l.  1)  =  1  , 


(7.18) 


meaning  that  for  all  2^,22  in  the  truncated  region  of  R^,  the  basic  matching  level  is  indifferent, 
i.e.,  all  are  equally  likelyl 

Similarly,  for  Aj  in  equation  (7.3)  with  membership  function  described  in  equation  (7.4),  while 
BR(own)  and  BL(ack)  are  colors  “close  to"  each  other,  their  relations  to  A2  are  both  seen  to  be  small  and 
hence  BR  and  BL  should  not  have  a  good  match.  Indeed,  the  maximal  possible  copula  min  shows 


0(match(A2))(BR.Bf.)  =  min(0.2. 0.1)=  0.1  , 


(7.19) 


etc. 


In  turn,  the  conditional  basic  matching  attribute  membership  functions  are  of  the  form,  for  any 
observed  data  set 


y  =  (yi.t. ><,2)1.1. ...m  e  1'  =  X(Dyt.  X  Da)  . 

I  »  1 

yijeDA^.  ;=  1,2;  i  =  1,  ...m  . 
0(match(Aj))(Ziiyi) 


(7.20) 


=  0(match(A,))(z,-,,z,2|yn,yi2)  (7  2 1) 

=  copi(0(A,)(z„|y»i).0(Ai)(z,2|yi2))  . 

If  one  knows  the  antecedent  function  g  in  equation  (6.7)— not  often  so  in  practice— 0('^i)(zijiyi7)  can  be 
obtained  as  a  conditional  fuzzy-set  membership  function.  On  the  other  hand,  the  function  may  be  obtain¬ 
able  directly  from  experts  in  tabular  form  tpi^i)i^ij\yij)  versus  Zi;  and  ytj .  For  simplicity,  one  could  assume 
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^(A/OCzyby)  =y)ii(piAi){zij);yij)  , 


(7.22) 


for  some  function  V'l  of  two  variables  obtained  empirically.  (If  =cop/(f,  /(s))/  /(s),  essentially 

equation  (6.1)  is  obtained  back.) 
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8.  DEVELOPMENT  OF  MODIFIED  PACT:  INFERENCE 

RULES  AND  ERRORS 


Next,  suppose  a  collection  of  inference  rules  has  been  culled  from  experts,  connecting  correlation 
variable  x  indirectly  with  observed  data  y,  through  auxiliary  variables  z:  Specifically,  let  the  collection  of 
corresponding  conditional  membership  functions  be  denoted  as,  for  any  possible  true  values 


V  j  m 

z  =  (Zjl>  Z/2)(.  1,, .  ,0)  c  =  X  X  D^.  ), 

i  =  I 


2ij  €Da.,  ;=1,2:  /=l,...m. 


{rule,(jt,z)  ;  ii:  =  1,  ..,r}  ,  is  defined  by 


rule*  2  rule*(x|z)=  (cons* (x)| ant* (z)) 


cons*(x)=  0(mod;^  ^)(0(Ao)(x)),  xe  [0, 1} 


ant*(z)  =  comb*(<t.  or,  not)  ('WtCz))  . 


where 

"i*(z)  =(0(mody^  p(^(match(A/))(zi))),>y^  .  (8.5) 

Zi  =(zi,,z/2),  (8.6) 

Here,  comb(<ft,  or,  not)  is  some  well-defined  logical  combination  of  a  copula  cop(*)(0,  1]^  -►  |0,  1) , 
cocopula  cocop(*)(0,  1]^  -•  (0,  1]  ,  and  negation  1  ~  (  )  [0.  1])  applied  to  the  modifications 

mody^  .  of  matches.  Also,  depending  on  =  1.  ..r  . 

y*./e{1....7}.  1  =  0.  (8.7) 

and  index  set  ./*  of  attributes  satisfies 


0  »*  /*  £ 
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For  example,  one  might  have  the  following  three  rules,  given  also  in  symbolic  exponential  forms; 


rulei  **  “If  Ai  matches  very  high,  but  Aj  matches  low, 

then  correlation  level  is  moderately  high"  (8.9) 

**  <L  Aj*)  . 

where 

Ji  =  {l,2};  j\o  =  3',  y’li  =  1;  711  =  6; 

rulei  ♦♦  “If  Ai  does  not  match  high,  or  if  Aj 

matches  moderately  high  then  correlation 

level  is  low  medium" 

-«•  (a“*|(1-a“»  or  Aj^)  , 

where 

720  =  5;  721  =  1;  723  =  3  ; 

rules  **  “If  Ai  matches  high  and  Ai  matches 
mediumly  and  A3  matches  very  low, 

then  correlation  level  is  low" 

♦♦  {A“*|i^*  A  A^*  A  a“^)  , 

where 

73  =  {1,2, 3};  730  =  6;  731  =  2;  Jn  =  4;  733  =  7.  (8.14) 


(8.12) 


(8.13) 


(8.10) 


(8.11) 


The  it'*  inference  rule  at  values  x  and  z  states  basically  “if  the  logical  combination  of  modifiers  of 
matches  ant*(z)  holds,  then,  the  correlation  level  cons*(jc)  holds"  or  equivalently  “the  degree  of 
compatibility  of  (x,z)  relative  to  the  conditional  form  (consjt|ant/t). 

Finally,  with  all  of  the  above  established,  equation  (6.7)  can  be  utilized  as  follows:  For  each  inference 
rule  *,  <:  =  1, ..,  r,  let 

/(x)  =  cons*(x)  ,  (8.15) 


as  in  equation  (8.3); 

}  (y,  2)  =  ant*(z)  , 


(8.16) 
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as  in  equations  (8.4)  through  (8.6),  and  let 


(A(z)|g()’))  =  ant*(z|>)  =‘^(ant*(z)|y) 

=  ant* (z) (with  z  replaced  by  zi>)  (8.17) 

d 

=  comb*(<t,or,no/)(ni*(z|y))  . 


where 


'«*(z|y)  =  (0(modyj^  p[0(match(Ai)(z,|y/)l)„y^  (g  jg) 

a  conditional  error  form  for  the  A**  inference  rule  antecedent. 

Substituting  equations  (8.15)  through  (8.18)  into  equation  (6.7),  using  equation  (8.2),  yields  for  all 
ace[0,  1],  the  posterior  form 

cons*(z|y)£(cons,(jc)|y)  =  max  (rule*(x|z)  •  antt(zly))  .  (8.19) 

ztD  ' 


where  zeD  means  (zjj,z/2f i=  l,..,m)  . 

If  the  simplifying  form  in  equation  (7.22)  is  assumed,  then  equation  (8.19)  also  simplifies,  by 
substituting 

y=  1.2;  is  l....,m  .  (8.20) 


and  letting 


L  ~  (fiit  ti2)i,i,,.. 


(8.21) 


One  obtains  a  reasonable  upper-bound  approximation  (since  range  (^(A,))  C  [0,  IJ  in  general): 


cons*(x|y)  S  cons*(x|y)o 

=  max  ((cons*(x)|a*^))  •  a *(v>* (/,>)))  , 

1»[0. 11*'" 

where 

OQc(t)  =  combkiA,  or.  not) .  (8.23) 

r}k  =  (0(mody^  p{copi(ti,,  rij)))/,^^  .  (8.24) 

Okiipkd.y))  =  combk(A,or.not)(ifik(L>y))  .  (8-25) 

'Pkd^y)  -  (^(mod;^picop,(V'i(/,i,y,i).  V'i(^2.>’/2))l)i.y*  (8.26) 
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In  turn,  the  r  posterior  forms  coni*(X|>'),  t  =  1,  ...r,  must  be  combined  into  one  overall  posterior. 

By  the  diagonalization  technique  (reference  14),  one  can  naturally  combine  the  separate  posteriors  in 
equation  8.19  as  the  solution,  cons{X\y),  of  the  equation 

cop(cons(x|y),cons(diag|y))  =conso(;c|y)  , 

where 

conso(x|y)  =  cop((cons*(x)|y))*,,....r)  , 
cons(diagjy)  =  max  (conso(x|y))  • 

Jt»10.  l] 


(8.27) 


(8.28) 
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9.  SUMMARY  OF  INPUTS  AND  COMPUTATIONS 


For  both  the  full-modified  PACT  form  in  equation  (8.19)  and  the  shortened  upper-bound  form  in 
equation  (8.22),  the  following  basic  input  steps  are  required: 

1.  Determine  the  master  list  of  auxiliary  attributes  Ai,..,Am  and  obtain  observed/updated/ 

reported  data  c  D^.  for  target  j  relative  to  A,,  ;  =  1,  2.  /  =  1, ...  m. 

2.  Obtain  the  table  of  appropriate  values  of  exponents  at,  /=  1,  ...7  in  equations  (7.5)  and 
(7.7)  for  interpreting  modifiers  mod/,  /=  1,  ..,7. 

3.  Provide  the  master  list  of  inference  rules  rulei, ...,  rule/  from  experts  in  syntactic  form.  The 
syntactic  structure  is  given  as 

rule*  ♦♦  {cj  ;  combki,&.,or.not){{aj.  .)uj.  ) 

V - ^ 

conseq.  anttcedent 

exponent  exponent  combination 

for  some  0  ^  Jt  {1.  m},  ik  =  1, r. 

If,  in  particular,  comb*(<S:.o/',noO  is  strictly  cop(*),  then  one  can  write 
rul«*'^(«y*.o:  («;•*., ).w^). 

4.  Determine  for  each  attribute  A/,  the  membership  function  0(A,)  -♦[  0,1],  /  =  0, 

1,  2,  ..,m. 

5.  Determine  the  appropriate  forms  for:  copula  cop„  corresponding  to  match  ('4i),  /=1,  ..,  m; 
copula  cop*  and  cocopula  cocop*  for  rule* ,  k  =  1, ..,  r ,  and  copula  cop  for  use  in  combining 
the  PACT  outputs  for  each  inference  rule. 

For  only  the  full-modified  PACT  form  in  equation  (8.19),  the  following  input  is  required: 

6.  Determine,  directly  via  experts,  for  each  attribute  Aj ,  the  conditional  membership  function 

bi)  :  Da.  -*(  0,  IJ,  for  all  yic  Da^,  i  =  1.  .,  m 

For  only  the  upper-bound  form  for  PACT  in  equation  (8.22),  the  following  input  is  needed: 

7.  Determine,  directly  via  experts,  for  each  attribute  A,  ,  the  function  V’l  relating  0(A,)(z/y)  and 
yij  in  the  conditional  form  given  in  equation  (7.22),  j=l,  ..,  m.  Thus,  utilizing  ^(Aj)  and 
observation  yij,  one  wishes  to  obtain  the  transfer  relation 


when  yij  is  observed,  ;'=1,2;  /=1,  ..,  m. 

For  the  full-modified  PACT  form  in  equation  (8.19),  table  9-1  provides  a  summary  of  the  required 
computations,  with  figure  9-1  giving  the  corresponding  full  flowcharts. 
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Table  9-1.  Summary  of  computations  determining  PACT  relative  to 
the  i'*  given  inference  rule,  in  its  full-modified  form. 


Compute  for  all  z,i ,  Da.  , 

and  all  xe  [0, 1],  with  yn.yn  cDa.  given,  i=  1.  ...m  : 
0(match(A,))(z,)  =  cop,(0(Ai)(z„),^(Ai)(z/2))  , 
tn*(z)  =  (^(mody^  p[0(match(A/))(z,)l)i,j-^  . 
antt(z)  =  comb*  (4,  or,  not)  (m*(z))  , 

cons*(;t)  =0(mod/^  ^)(0(Ao)(z))  , 

rule*(x|z)  =  (cons*(x)|ant*(z)) 

=  cop(*) (cons* (x),  ant* (z))/ant*(z)  , 

provided  ant*(z)  >  0  . 

0(match(A/))(z/|y,)  =  cop,(0(A,)(z/i|y,-,),<J(Aj)(z,-2lyi2))  , 
=  (0(niod;^  pi0(match(A,)(z,|y,)J)/,y^ 
ant*(z|y)  =comb*(4,or,no()(m*(z|y))  , 

cons*(x|y)  =  max  (rule* (x|z)  •  ant*(z|y))  . 

ZtD 


(7.8) 

(8.5) 

(8.4) 

(8.3) 

(8.2) 

(6.1) 

(7.21) 

(8.18) 

(8.17) 

(8.19) 


BEGIN  STEP  1: 

DETERMINE  A,.. ..Am  AND 


OBSERVE  DATA 


yij  €  Da^.  j  =  i,2, 


STEP  2: 
DETERMINE 
TABLE  OF 
EXPONENTS 
a,  FOR 
(Mod,) 


STEP  3: 
CHOOSE 
MASTER  LIST 
OF  INFERENCE 
RULES  ruUk  , 
k=  I, ...  r 


STEP  4. 
DETERMINE 
MEMBERSHIP 
FUNCTION 
<P(Ai):  DAr*[0.  11 
/  =  0, 1,  ...m 


STEP  5: 
PICK 
copi  FOR 
match(Ai)-, 
copik) . 

coco/j(*)  FOI 
rulejc 


STEP  6: 

DETERMINE 

CONDITIONAL 

MEMBERSHIP 

FUNCTION 

4>(Ai)(-  bi)  : 

i  =  1,  ...m 


PICK 

ruUk 


EQ  (8.4),  (8.3): 

COMPUTE  FOR  ALL  Jtc  (0,  1], 
2€D  , 

antkiz),  conskix) 


EQ  (8.2),  (6.1): 

COMPUTE  FOR  ALL  X€[0, 1] 
zeD  , 

ruUkixJz)  =  (conSk(x)\antk(z)) 


EQ  (7.8): 
COMPUTE 
^(ma{ch(Ai)(zi) , 
ALL  z,  €  Da  ^  Da^  , 
/* 


EQ  (8.17): 
COMPUTE  FOR 
ALL  rc  D, 
antk(z\y) 


EQ  (7.2): 
COMPUTE 
<p{match(Ai)(zi\yi) , 
ALL  ZifDA-  X  Da.  , 
i  =  1, ...  m 


EQ  (8.19): 

COMPUTE  FOR  ALL  JC€(0.  1] 
co/w*(j:|)>) 


Figure  9-1.  Flowchart  of  computations  for  determining  full-modified 
PACT  form  relative  to  the*'*  inference  rule. 
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For  the  shortened  upper-bound  form  for  PACT  in  equation  (8.22),  table  9-2  provides  a  summary  of 
the  required  computations,  with  figure  9-2  giving  the  corresponding  full  flowcharts. 

Table  9-2.  Summary  of  computations  determining  the  shortened 
upper-bound  form  for  the  given  inference  rule. 


Compute  for  all /ii,t  12,  jcc[0,  1],  f=l,..,m  : 

nki!)  =  (0(mod;j^  .)[cop,(/ii,ri2)])i,y^  . 

(8.24) 

OftCf)  =  combk(&,  or.  not)  (nk(i))  , 

(8.23) 

conSit(Jc)  =^(mody^  ^){^(Ao)(x))  . 

(8.3) 

(const(x)|Q(*(/))  =cop(*)(cons*{x),o(*(t))/a*(t)  . 

(6.1) 

provided  o*(t)  >  0  , 

=  (0(mody^picopi(V',(t,i.y/,),  V'/(ti2,y,2))]),>y^  . 

(8.26) 

otk(%(l.y))  =combki&.or,not)(^k(L>y))  . 

(8.25) 

cons*(x|y)o=  max  (cons*(x)|  «*(/)).  a*  (V-kC/.y))) 

If  10.  ip'” 

(8.22) 

For  both  the  full-modified  and  shortened  versions  of  PACT,  table  9-3  provides  a  summary  of  the 
required  computations  needed  to  combine  the  separate  outputs  of  PACT  relative  to  each  inference  rule. 

Table  9-3.  Summary  of  computations  determining  the  shortened 
upper-bound  form  for  each  inference  rule. 


Compute  for  all  x€  (0,  l],y  eD  given  (i.e.,  yi,i.yi,z  €Da.  given,  i  =  1. ...  m)  ; 

conso(x|y)  =  cop((cons*(jcly))*=i.,..r)  , 

(8.28) 

cons(diagly)  =  max  (conSo(x|y))  : 

Xf[o.i] 

(8.29) 

Solve  for  cons(x|y)  ; 

cop(cons(x|y),cons(diag|y))  =conso(x|y)  . 

(8.27) 
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10.  AN  ILLUSTRATIVE  EXAMPLE 


m  =  3  auxiliary  attributes: 


(10.1) 


=  geoi,  Aj  =  color.  As  =  radari  , 


(10.2) 


Modifier  exponents: 

ai=4,  0!2  =  3,  03  =  2,  04  =  1,  05  =  0.75,  04  =  0.5,  0  7  =  0.25. 

cop,  =  prod,  cocop,'  =  probsum,  /  =  1, 2, 3  ; 
cop  =  min,  <^{not)  =  1  -  (  )  . 


(10.3) 


(10.4) 


r  =  3  inference  rules:  rulei,  rule;,  rules,  as  given  in  equations  (8.9)  through  (8.14),  with  logical 
operators 

cop(i)  =  cop(3)  =  prod,  cop(2)  =  min  .  (10.5) 


For  Ay,  Da.~  R2,  ;=  1,3,  with 

fij€  r2.  Ay  2  by  2  positive  definite  matrbt  , 


ip(Aj(zj)  =Xjie  J  J  •'  , 


(10.6) 


For  all  rye  R2, y=/,J  ; 

For  A2,  =  [YEL,  OR,  BL,  WH,  BR}  with 

^(A;)  :  Daj  -*  10,  IJ  given  by 

^(A2)(r£L)  =  0.6,  <p{Ai){OR)  =  0.5,  ip{A2){BL)  =  0.1  , 
ipiAiiWH)  =  0.9,  4>iA2)(BR)  =  0.2  . 

To  determine  ipi  (equation  7.22)  for  i=l,3: 

First,  recall  the  following  (see,  for  example,  Anderson,  reference  15,  chapter  1): 

Let  random  variables  z,  ifci  by  1,  y,  it;  by  1,  be  jointly  disuibuted  A),  where 


fi  = 


,  A  = 


(10.8) 


/t,  ki  +  k2  by  1,  with  fii ,  k;  by  1  constant  (mean).  A,  *,  +*;  by  *1+^2  positive  definite  with  A, ,  ki  by  *, 
positive  definite  and  A,; ,  kx  by  *2  constants,  <=1,2.  Then,  the  conditional  random  variable  (z|y)  is  distrib¬ 
uted  Nn^(E{z\y),Cov{z\y)),  where 
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E[z\y)  =^i  +  An  •  Aj*  •  (y-fit)  , 


(10.9) 


Cov(z\y)  =  A]  -  AijAi^Afz  . 

Next,  suppose  the  simple  additive  linear  regression  relation  holds 


(10.10) 


y  =z  =  w  , 


(10.11) 


where  z  and  w  are  statistically  independent  with  z  distributed  Ai)  (Aj  pos.  del.)  and  w  distributed 

A*^(M3.A3)(A3  pos.  def.). 

Then  is  jointly  distributed  and  equations  (10.8)  through  (10.10)  hold,  where  now 


Ai2  — Aj,  A2  =  Ai+A3  ,  (10.12) 

(10.13) 

whence 

£^(2|>')  =  +Ai(Ai +A3)*‘(y-;zi -/t3)  .  (10.14) 

Cov(2\y)  =  Ai  -  Ai(Ai  +  A3)"'Ai  =  (Aj’  +A3*  )**  .  (10.15) 


It  follows  that  for  the  normalized  version  of  Gaussian  distributions,  letting  An  =Ai2  =  1  in  equation  7.2, 
one  has  for  its  possibility/probability  function  counterpart 


P{z\y)  =  e  -  z  eR2, 


(10.16) 


where  by  expanding 

Qi^.y)  =  (2  - Eiz\y))‘^Cov-^iz\y)(z  - E(z\y))  -  (z-m)'^'  (Aj’  +  Aj*)"'  •  (z-/ii) 
+  (0 -Ml -Mi)^  •  R  •  (y -Ml  -M3))  -  2S  . 


where 


R  =  (Ai  +  A3)''Ai(Ai' +  A3')Ai(Ai  +  A3)'* 

=  Ai*  •  (Ai*  +  Ai*)-*  •  (AT‘  +  Ai*)(Ai*  +  Aj*)-'  •  Ai* 

=  Ai*  •  (Al*  +  Ai*)-*  •  Ai*  ,  (10.18) 
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and 


S  =  O'  -/Ml  -^3)^  •  (Ai  +  As)"'  •  Ai  •  (Aj'  +  j 

-  (y  -  fii  -  ■  (Ai+As)"'  •  (A1  +  A3)  • 

=  0-/M1  -/M3)^  •  Aj^  ■  (z-/Ui) 

-  i 

=  0  -/“i  •  Aj  ^  •  0(z)  •  y(z) 

where 

^  ^3^  ‘  (^“^1)  • 

yO)  =  S  A3  ^  •  0  -/Ml)  J 

=  ((2-/ii)^A3‘(z-/i,))^  • 

Note  that  by  its  very  definition, 

H  6(2)  1.  all  2  ^  fly  . 

Next,  make  the  conjugate-like  assumption  that 
Aj  =  K  •  A3  , 

for  some  real  positive  constant  k  to  be  determined. 
Denote  the  prior  function  for  z  as 

P{z)  =  z  € R  *«’ , 


•  (z-/Mi) 

3'  •  (2-A1) 

(10.19) 

(10.20) 

(10.21) 

(10.22) 

(10.23) 

(10.24) 


corresponding  to  the  normalized  version  of  z  being  distributed  A*pCmi,  Ai)  as  above.  We  wish  to  solve  for 
P(zly)  in  terms  of  F(z)  and  y—  and  hence,  ^  la  equation  7.22  -  determine  the  most  appropriate  function 
V’i  (and  gi)  ; 

Leaving  ff(z)  alone  temporarily,  clearly  applying  equation  (10.23)  to  equation  (10.21)  yields 


y(z)  =k2  •  ((z-^,)^Al‘(2-;<i))* 
=  /cI-  (-2logF(z))J  . 


(10.25) 


Using  equation  (10.23),  equation  (10.18)  becomes 


F  =  (k/k  +  1))  ■  A3'  =  {k^/(k  +  1))  ’  A7' 


(10.26) 
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and 


Ar‘  +  Aj'  =  (!+«:)  *  Al’  , 

(10.27) 

from  which 

(2  -//i)’"(Al'  +  Aj’)(z  -/ii) 

=  (l  +  #c)(z-/2i)^AT*(z-/i,)  . 

whence 

(10.28) 

Next,  replacing  8(2)  by  the  approximating  consunt 

60  =  0(2),  with  2  replaced  by  E(y)  =  fii  +  , 

(10.29) 

yields 

^  1  -i 

00  *  (l/OzfAjVz)^  •  Aj^  •  /fj  e  R  *<> 

(10.30) 

Then,  gathering  together  equations  (10.25)  through  (10.30)  shows  that  under  the  assumptions  in 
equations  (10.23)  and  (10.29),  by  substitution  into  equation  (10.17),  equation  (10.16)  becomes 

PiAy)  * 

=  A(y)  •B(/>(2))  •  0(^(2), y)  , 

(10.31) 

where 

(10.32) 

S(/>(2))=/’(2)'«, 

(10.33) 

.'1  1 

C{PU),y)  = 

(10.34) 

Main  Aoplication:  Let  *0  =  2,  P{2)  =  0(A,)(z;),  y  =  1,  3  . 
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11.  SUMMARY  AND  CONCLUSIONS 


This  technical  document  is  a  followup  to  the  basic  development  of  the  PACT  algorithm  for  combining 
linguistic-based  and  probabilistic  information  in  track-data  association  or  correlation.  While  the  same 
spirit  of  PACT  is  followed  (see  reference  1  for  background),  a  number  of  modifications  are  made  in  order 
to  reduce  the  nonsystematic  or  ad  hoc  aspects.  Those  include:  modeling  of  implication  in  PACT’S 
inference  rules  by  using  conditional  fuzzy  sets  or  their  appropriate  approximations,  rather  than  by- 
fuzzification  of  material  implication,  as  in  the  previous  PACT  form;  utilization  of  one  inference  rule  at  a 
time  to  obtain  the  basic  PACT  posterior  form  for  correlation,  followed  by  a  combining  procedure 
(diagonalization  [reference  14]),  making  use,  when  feasible,  of  a  fuzzy-set  extension  of  the  author's 
previously  established  conditional-event  algebra  (reference  13);  choosing  copulas  (rather  than  possible 
noncopula  t-norms  as  in  the  previous  version  of  PACT/and  cocopulas  for  all  logical  connectors,  based 
upon  random-set  considerations,  outlined  in  Chapter  5). 

Finally,  a  new  shortened  upper-bound  approximation  to  the  basic  PACT  output  is  exhibited,  replacing 
the  iterated  disjunction  operation  over  all  auxiliary  attribute  domains  by  the  more  standardized  domain  of 
the  unit  interval,  appropriately  replicated.  Based  on  the  summary  of  inputs  and  computations  (Chapter  9) 
for  both  the  full-modified  version  of  PACT  and  the  shortened  upper-bound  one,  it  follows  readily  by 
inspection  (especially  of  tables  9-1  and  9-2)  that 


running  time  for  shortened  version  of  PACT 
running  time  for  full  modified  version  of  PACT 


where,  as  usual,  m  is  the  number  of  attributes,  card(DA^)  (assuming  all  attribute  domains  are  finite  or 
made  finite)  is  the  cardinality  of  D^.,  /=l,..,m,  and,  finally,  n+1  is  the  number  of  elements  in 
{0,1,1,  ...ill,  1}  .  the  n'**  discretized  version  of  unit  interval  [0,1]. 

n  n  n 

Naturally,  a  tradeoff  exists  between  the  running  time  of  the  shortened  version  of  PACT  for  the  n'* 
discretized  version  of  [0,1]  versus  implementational  fidelity  versus  accuracy  relative  to  the  full-modified 
version  of  PACT.  Future  numerical  experiments  will  hopefully  address  this  issue.  Such  experiments  also 
are  planned  for  testing  the  robustness  and  improvement  in  accuracy  for  the  new  versions  of  PACT  relative 
to  use— or  lack  of  use— of  linguistic-based  attributes,  for  fixed  geolocation  and  other  statistical  attribute 
intormation. 


39 


12.  REFERENCES 


1.  Goodman,  I.  R.  1986.  “PACT:  An  Approach  to  Combining  Linguistic-Based  and  Probabilistic  Infor¬ 
mation  in  Correlation  and  Tracking.”  NOSC  TD  878  (March).  Naval  Ocean  Systems  Center,  San 
Diego,  CA. 

2.  Goodman,  I.  R.,  H.  L.  Wiener,  and  W.  W.  Willman.  1980.  “Naval  Ocean-Surveillance  Correlation 
Handbook,  1979.”  NRL  Report  8402.  Naval  Research  Laboratory,  Washington,  DC.  September  17, 
1980.  (Unclassified,  but  limited  distribution) 

3.  Wiener,  H.  L.,  W.  W.  Willman,  I.  R.  Goodman,  and  J.  H.  Kullback,  “Naval  Ocean  Surveillance 
Correlation  Handbook,  1978."  NRL  Report  8340.  Naval  Research  Laboratory,  Washington,  DC. 
October  31,  1979.  (Unclassified,  but  limited  distribution) 

4.  Larsen,  R.  W.,  and  R.  A.  Dillard.  1989.  “A  Fuzzy  Logic  Decision  Support  Tool.”  NOSC  TR  1286 
(October).  Naval  Ocean  Systems  Center,  San  Diego,  CA. 

5.  Eisen,  M.  1969.  Introduction  to  Mathematical  Probability  Theory.  Prentice-Hall,  Inc.,  Englewood 

cuffs,  NJ. 

6.  Dubois,  D.,  and  H.  Prade.  1980.  Fuizy  Sets  and  Systems.  Academic  Press. 

7.  Goodman,  1.  R.,  and  H.  T.  Nguyen.  1985.  Uncertainty  Models  for  Knowledge-Based  Systems.  North- 
Holland  Press,  Amsterdam. 

8.  Goodman,  I.  R.  1980.  “Identification  of  Fuzzy  Sets  With  a  Class  of  Canonically  Induced  Random 
Sets.”  Proceedings  of  the  I9th  IEEE  Conference  on  Decision  and  Control,  pp.  352-357. 

9.  Goodman,  I.  R.  1984.  “Some  New  Results  Concerning  Random  Sets  and  Fuzzy  Sets.”  Inf  Sci  34, 
pp.  93-113. 

10.  Goodman,  I.  R.  “Algebraic  and  Probabilistic  Bases  for  Fuzzy  Sets  and  the  Development  of  Fuzzy 
Conditioning,”  in  the  book,  Conditional  Logic  in  Expert  Systems  (I.  R.  Goodman,  M.  M.  Gupta,  H. 
T.  Nguyen,  G,  S.  Rogers,  Ed.),  North-Holland  Press,  Amsterdam  (1991),  pp.  1-69. 

11.  Schweizer,  B.,  and  A.  Sklar.  1983.  Probabilistic  Metric  Spaces.  North-Holland  Press,  NY. 

12.  Goodman,  I.  R.  1990.  “Three-Valued  Logics  and  Conditional  Event  Algebras.”  Proceedings  of  the 
First  International  Symposium  on  Uncertainty  Models  and  Applications,  University  of  Maryland, 
College  Park,  Maryland,  December  1990,  pp,  31-37. 

13.  Goodman,  I.  R.,  H.  T.  Nguyen,  and  E.  A.  Walker.  1991.  Conditional  Inference  and  Logic  for 
Intelligent  Systems:  A  Theory  of  Measure-Free  Conditioning.  North-Holland  Press,  Amsterdam. 

14.  Goodman,  I.  R.  1989.  “Applications  of  a  Conditional  Event  Algebra  to  Data  Fusion;  Part  2.”  Pro¬ 
ceedings  of  the  Third  Tri-Service  Data  Fusion  Symposium.  Applied  Physics  Laboratory,  Johns  Hop¬ 
kins  University,  Laurel,  Maryland,  May  1989,  pp.  181-193. 

15.  Anderson,  T.  W.  1984.  Introduction  to  Multivariate  Analysis.  Second  Edition,  John  Wiley  &  Sons, 
NY. 


41 


REPORT  DOCUMENTATION  PAGE 


Form  Approved 
0MB  No  0704'0m 


PubUc  rapoiting  buFOen  for  tbK  coliocoon  of  informailOA  is  ostlmatod  to  tvorago  1  hour  por  rosponso.  tnctuding  m«  timo  tor  rv^/lowtng  insmictiont  soarcning  oxistlAS  oau  souroos.  oathsnng  «nd 
maintaifting  tht  data  naodod.  and  oompMlr>g  and  revtawtog  the  ooitoction  at  informaiton.  Sand  commar>ts  raganamg  this  burdan  astimaie  or  any  atr>af  aspaci  or  this  oottection  or  intormatton  inciudtno 
tuggaattactttof  caducing this  bucdan. to Washtogton  Haadquartars Sarvioaa.  Oiractorata tor  Intormatton Oparattons  and  ftapocts.  121SJartanor\Ua«HH»gtiMray.  S^matZOS.  Anington.VA  222Q2-A302 
andfotna  omca  of  Managaihant  ar>d  Budgat.  PaparworK  Raductlon  Pfc^act  {0704-0166).  Washingion.  DC  20603 


2.  REPORT  DATE  3  REPORT  TYPE  AND  DATES  COVERED 


1.  AGENCY  USE  ONLY  ^Mva  MvNd 


4  .  ms  AND  suermE 


July  1992 


A  REVISED  APPROACH  TO  COMBINING  LINGUISTIC  AND  PROBABILISTIC 
INFORMATION  IN  CORRELATION 


6.  AUT>ICIR(S) 

Dr.  1.  R.  Goodman 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  AOORESS(ES) 

Naval  Ocean  Systems  Center 
San  Diego,  CA  92152-SOOO 


S  SPONSORING/MONTTORING  agency  NAME(S)  and  A00RESS<ES) 

Chief  of  Naval  Research 
OCNR 

Arlington,  VA  22217-5000 


Final:  FY  90  -  FY  91 


5  FUNDING  NUMBERS 

PE:  0602936N 
PR:  RV36121 
WU:  DN300036 


a  PERFORMING  organization 
REPORT  NUMBER 

NOSC  TR  1386 


10  SPONSORING/MONfTORING 
AGENCY  REPORT  NUMBER 


12a.  DISTRIBUnON/AVAiLABlLJTy  STATEMENT 

Approved  for  public  release;  distribution  is  unlimited. 


12t>  DiSTRIBUnON  COOE 


13.  ABSTRACT  (Maximum  200  woMa) 

With  the  advent  of  the  field  of  Artincial  Intelligence  (Al),  new  inaighta  have  been  gained  into  infoimation  handling  in  general,  and 
target  data  aaaociation,  in  particular.  It  ia  now  clear  that  the  traditional  uae  of  only  ato^aatic  geolocation  information  (i.e.,  vector  poaitiona, 
velocitiea,  etc.)  ia  not  really  adequate  to  describe  the  full  data  association  problem.  Rather,  the  situation  can  be  markedly  improved  by  using 
previously  diagarded  information  in  the  form  of  linguistic  descriptions  or  narratives,  such  as  ‘probably  very  long,*  ‘appears  to  be  flying  a 
dark-blue  flag,”  ‘is  rather  oblong  in  appearance,”  etc.  However,  at  present,  despite  the  inroads  made  with  AI  techniques,  there  is  no  adequate 
comprehensive  theory  covering  the  complete  modeling  of  such  information  in  coi\iunetion  with  the  classical  measurement  information  of 
geolocation. 

A  previous  attempt  at  addressing  the  above  issue  was  formalized  through  the  development  of  the  PACT  (Possibilistic  Approach  to 
Correlation  and  Tracking)  algorithm  developed  at  NOSC.  The  novelty  of  Uie  algorithm  was  its  explicit  use  of  both  linguistic-based  evidence 
and  probabilistic  information  throu^dr  a  common  structure  of  a  possibilisticlfuzzy-set  model.  The  algorithm  is  essentially  a  generalization  of 
a  conditional  form  of  the  well-known  total  probability  expansion  theorem,  an  alternative  to  Bayes’  theorem,  when  priors  are  not  readily 
determinated;  and  a  number  of  auxiliary  attributes  must  be  utilized  to  connect  the  parameter  of  interest— here,  data  association  or 
correlation— with  the  observed  data.  Specifically,  PACT  consists  of  a  disjunction  over  all  attribute  arguments  of  a  coi\junction  of  conditional 
fuzzy-set  membership  functions,  one  representing  inference  rules  connecting  data  association  with  combinations  of  matching  of  attributes, 
the  other  representing  conditional  errors.  A  typical  inference  rule  is  the  form  ”if  geolocation  values  match  moderately,  but  visual  descriptions 
match  highly,  then  the  correlation  level  between  the  targets  should  be  very  high.” 

While  numerical  experiments  indicated  PACT  improving  upon  the  traditional  approach  of  using  only  geolocational  information,  serious 
questions  have  arisen  concerning  the  ad  hoc  nature  of  the  choice  of  logical  operators,  the  modeling  of  the  conditional  fuzzy-set  membership 
Unctions,  and  last— but,  not  least— the  overall  running  times  required  to  implement  the  algorithm. 

The  current  work  seeks  to  remedy  the  above  difflculties  of  PACT  by  re-examining  the  mathematical  and  structural  foundations.  The 
consequence  of  this  is  twofold; 

1.  Conditional  fuzzy-set  membership  functions  should  not  be  identifled  as  fiizzy-set  extensions  of  the  usual  material-implication 
operator  applied  to  appropriate  argument  membership  functions,  but  instead  as  natural  extensions  of  the  author's  newly  developed 
conditional  event  algebra.  This  is  presented  in  detail  in  the  paper  ‘Algebraic  and  probabilistic  bases  for  fuzzy  sets  and  the  development  of 
fiizzy  conditioning”  in  the  book.  Conditional  Logic  in  Expert  Syalema  (I.  R.  Goodman  et  al.,  Ed.,  North-HoUand  Press,  1991).  Roughly 
spesiking,  conditional  events  (a|b),  (c|d),  etc.,  are  the  ‘events  inside  the  conditional  probability  evaluations  p(a|b),  p(c|d),  etc.  These  can  be 
combined  logically  prior  to  any  probability  evaluations  p  to  yield  again  (conditional)  events.  Conditional  fuzzy  sets  are  the  fuzzy-set 
extensions  of  these  entities,  fionsequently,  inference  rules  can  now  be  put  on  a  firmer  mathematical  basis  relative  to  their  interpretation  and 
evaluations.  On  the  other  hand,  for  puiposes  of  expediency,  at  times  it  may  be  more  desirable  to  approximate  such  conditional  fuzzy  sets  by 
simpler  quantities. 

2.  A  great  reduction  in  computation  time  for  implementing  PACT,  due  to  the  di^unction  operation  reiterating  upon 
attribute  domains  mentioned  earlier,  can  be  effected  through  an  appropriate  substitution  and  approximation  technique.  In  effect, 
the  disjunction  can  now  be  restricted,  without  loss  of  generality,  in  an  upper-bound  sense,  to  simplify  replications  of  the  more 
standardized  unit  interval. 


14  SUBJECT  TERMS 

correlation 
data  association 
knowledge-based  ifystems 


17  SECURITY  CLASSIFICATION 
OF  REPORT 

UNCLASSIHED 


fuzzy  sets 

possibility  functions 
inultivalues  logic 


IS  SECURITY  CLASSIFCATION 
OF  THIS  PAGE 

UNCLASSIFIED 


ie  SECURITY  CLASSlFCATICIN 
OF  ABSTRACT 

UNCLASSIFIED 


1$  NUMBER  OF  PAGES 

45 


16  PRICE  COOE 


20  UMOAnON  OF  ABSTRACT 

SAME  AS  REPORT 


NSN  7S4O0t.2SMS00 


SMMdd  tam  2S6 


N8N  784aoi-2aM600 


SUmtaid  term  2M  (BACK) 

UNCLASSIHED 


INITIAL  DISTRIBUTION 


Code  0012 

Patent  Counsel 

(1) 

Code  014 

K.  J.  Campbell 

(1) 

Code  0141 

A.  Gordon 

(1) 

Code  0141 

G.  M.  Dillard 

(1) 

Code  171 

G.  L.  Ruptier 

(1) 

Code  027 

V.  Ware 

(1) 

Code  02712 

Archive/Stock 

(6) 

Code  0274B 

Library 

(2) 

Code  40 

R.  C.  Kolb 

(1) 

Code  402 

J.  H.  Maynard 

(1) 

Code  42 

J.  A.  Salzmann 

(1) 

Code  421 

D.  L.  Conwell 

(1) 

Code  421 

M.  J.  Gherrity 

(1) 

Code  421 

F.  Kramer 

(4) 

Code  421 

I.  R.  Goodman 

(12) 

Code  422 

C.  J.  Funk 

0) 

Code  422 

M.  C.  Mudurian 

(1) 

Code  444 

R.  Larsen 

(1) 

Code  521 

S.  E.  Apitz 

(1) 

Code  721 

J.  Aitkinhead 

(1) 

Code  721 

J.  L.  Kaina 

(1) 

Code  741 

S.  C.  McGirr 

(1) 

Code  782 

M.  J.  Shensa 

(1) 

Defense  Technical  Information  Center 
Alexandria,  VA  22304-6145  (4) 

NCCOSC  Washington  Liaison  Office 
Washington,  DC  20363-5100 

Center  for  Naval  Analyses 
Alexandria,  VA  22302-0268 

Navy  Acquisition,  Research  and  Development 
Information  Center  (NARDIC) 
Washington,  DC  203^1-5000 

GIDEP  Operations  Center 
Corona,  CA  91718-8000 

NCCOSC  Division  Detachment 
Warminster,  PA  18974-5000 


Approved  for  public  release;  distribution  Is  unIMted. 


